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We consider evaluation of proper posterior distributions obtained 
from improper prior distributions. Our context is estimating a bounded 
function of a parameter when the loss is quadratic. If the posterior 
mean of (f> is admissible for all bounded (f>, the posterior is strongly ad- 
missible. We give sufficient conditions for strong admissibility. These 
conditions involve the recurrence of a Markov chain associated with 
the estimation problem. We develop general sufficient conditions for 
recurrence of general state space Markov chains that are also of in- 
dependent interest. Our main example concerns the p-dimensional 
multivariate normal distribution with mean vector when the prior 
distribution has the form g(\\6\\ 2 ) d6 on the parameter space R p . Con- 
ditions on g for strong admissibility of the posterior are provided. 

1. Introduction. In many standard parametric settings, the use of im- 
proper prior distributions to produce inferential proposals is well established. 
A variety of formal rules have been proposed to justify particular improper 
priors. This is especially true in invariant problems. The survey article of 
Kass and Wasserman [15] provides an excellent overview of many issues that 
arise in selecting and using improper priors, including decision theoretic con- 
siderations. Indeed, decision theory provides an appealing framework for the 
evaluation of improper priors via the (proper) posterior distributions they 
produce. A particular decision theoretic approach was described in [7], but 
subsequent suggestions have had more appeal. One of these involves a notion 
of "strong admissibility" and is the basis of the approach below. This notion 
was introduced in [8] and was called P-admissibility in [14]. 
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The idea behind strong admissibility is the following. Given a sample 
space (X, £>), suppose a random object X G X is to be observed and assume 
that the distribution of X is an element of the parametric model {P(-\8)\8 G 
G}. Consider a cr-finite improper prior distribution v and define the marginal 
measure M on (X,B) by 

M(B) = ( P(B\8)u(d8), BeB. 
Je 

Technical issues such as measurability, existence of integrals, etc. are treated 
carefully in the next section. When the measure M is cr-finite, a proper 
posterior distribution Q(d8\x) exists and satisfies 

P(dx\8)v{d8) = Q{d8\x)M{dx), 

in the sense that the joint measures on each side of the equality sign are 
equal. Interpreting Q(-\x) as summarizing knowledge about 8 after seeing 
X = x, the probability measure Q(-\x) on G can now be used to solve decision 
problems. Namely, one integrates a loss function with respect to Q(-\x) and 
then picks an action (depending on x) to minimize the expected posterior 
loss. This procedure is ordinarily called the formal Bayes method of solving 
a decision problem. 

As an example, let <f){8) be a bounded real-valued function on Q and 
consider a quadratic loss function 

(1) L(a,8) = (a-(p{8)) 2 , art. 

The posterior expected loss is minimized by the posterior mean 

(2) 4>(x)= f </>(0)Q{d0\x), 

Je 

and 4> is a formal Bayes estimator of c/>. Given any estimator t(x) of 4>(8), 
the risk function of t is 

(3) r{t,9)= f (t(x)-cf>(0)) 2 P(dx\e), 

Jx 

and admissibility of estimators is assessed in terms of the risk function. 

The formal posterior Q(d8\x) is strongly admissible if, for every bounded 
measurable function <p, the formal Bayes estimator (ft is admissible (this is 
defined more carefully in the next section). Loosely speaking, strong admis- 
sibility is regarded as an endorsement of Q(-\x) and, hence, of the improper 
prior z/, for use in making inferences about 8 after seeing X = x. 

A sufficient condition for strong admissibility is the recurrence of the 
Markov chain with state space G and transition function 

(4) R(C\9)= [ Q{C\x)P{dx\8). 

Jx 
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In words, (4) is the expectation of the formal posterior when X is sampled 
from P{-\9). There is more than one notion of recurrence that is relevant for 
strong admissibility. A discussion of this issue is given in the next section. 
Here is an example that illustrates the issues that motivated our research. 

Example 1.1. Suppose X has a p-dimensional multivariate normal dis- 
tribution with mean vector 9 £M P and covariance matrix the p x p identity, 
I p . This we write as X ~ N p (9,I p ). Consider Lebesgue measure as the im- 
proper prior distribution: u{d9) = d9 on = W . Standard calculations show 
that the formal posterior Q(d9\x) is a N p (x,I p ) distribution on W. Further, 
the transition function R(-\9) is a N P (9,2I P ) distribution on W p . Therefore, 
the one step transition of the Markov chain in this example can be described 
as follows: Given the chain is at 9 E M p , the next state of the chain is 9 + V, 
where V is N p (0,2I p ). The chain is thus a random walk on W. For p = 1 
or p = 2, the chain is recurrent (see [5] for p = 1 and [20], Chapter 3, for 
p = 2). But for p > 3, the chain is transient ([12], page 579). Therefore, for 
p = 1, 2, the formal posterior Q(-\x) is strongly admissible, but for p > 3, the 
recurrence argument fails. Of course, one suspects that, for p > 3, Q(-\x) is 
in fact not strongly admissible because of the existence of the James-Stein 
estimator for 9, but a rigorous proof of a "not strongly admissible" assertion 
for Q(-\x) is not known to us. 

Now, focus on the case of p > 3. Since the chain induced by the normal 
model and the improper prior li d9" is transient, it seems reasonable to look 
at a somewhat broader class of priors of the form 

(5) v(d9)=g (\\9\\ 2 )d9, 

where || • || denotes the Euclidean norm on W. It is natural to seek conditions 
on the function g$ so that the induced Markov chain is recurrent. It is this 
and related issues that give rise to much of the new material herein. The 
primary difficulties in addressing this issue are as follows: 

(i) The transition function cannot be computed explicitly, so the corre- 
sponding Markov chain is difficult to study. 

(ii) Even for special and rather simple <?o' s > techniques for establishing 
the recurrence of the induced chain are not available. 

This completes our initial discussion of this example. 

Here is an outline of the material in this paper. The next section contains 
notation and assumptions, a careful definition of strong admissibility, and a 
discussion of the Markov chain introduced above. In particular, two different 
notions of recurrence are described (neither of which involves the notion 
of irreducibility) , and their relation to strong admissibility is emphasized. 
Results in [8, 9, 10] underlie much of this material. 
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Section 3 contains the first of two main results that are of use in establish- 
ing strong admissibility. This result was conjectured in [8], pages 1166-1167. 
The idea, reminiscent of hierarchical prior proposals, is to give conditions un- 
der which the improper prior distribution u{dO) on © has a "disintegration" 
into n(d0\(3)s(dp). Here vr(-|/3) is a probability measure for each f3 and s is 
a cr-finite measure on a set A (typically, much simpler than O). Sufficient 
conditions for strong admissibility are then based on an induced Markov 
chain whose state space is A rather than G. The arguments in Section 3 rely 
on results involving the Dirichlet forms associated with the Markov chains 
under consideration (see [10]). 

Our second main result, presented in Section 4, concerns sufficient con- 
ditions for the recurrence of a Markov chain with a general state space (a 
Polish space) Y . The results in Section 4 are an extension and refinement 
of results in [16], which are in turn an extension of the early work in [17]. 
When Y = [0,oo), sufficient conditions for the recurrence of a set [0,m) for 
m sufficiently large are expressed in terms of the first three moments of the 
increments of the chain. 

In brief, recurrence of the Markov chain with transition function R implies 
strong admissibility. This paper provides two results that simplify the task of 
establishing recurrence. Application of the results to Example 1.1 illustrates 
a nontrivial case that justifies the new methodology in Sections 3 and 4. 
In particular, these methods are applied in Section 5 to the problem of 
Example 1.1 when the prior has the form given in (5). Conditions on the 
function go (which are necessarily dimension dependent) that imply strong 
admissibility are discussed in detail. Our treatment of Example 1.1 is a 
rigorous development of work begun in [16]. 

2. Background. The purpose of this section is to provide assumptions 
and rigorous statements that underlie the connection between strong ad- 
missibility and recurrence. The discussion here is an abbreviated version of 
some material in [10] and we assume the reader is somewhat familiar with 
that material. It is useful to keep Example 1.1 in mind. 

2.1. Model, prior and posterior. The sample space (X,B) is assumed to 
consist of a Polish space X (a complete separable metric space) coupled 
with the Borel a-algebra B. The parameter space (0,C) is also assumed to 
be Polish with C the Borel cr-algebra. A statistical model {P(-\9)\9 6 0} for 
an observable quantity X £ X specifies the modeling assumption. We assume 
the P(-\-) are Markov transition functions — that is, P(-\9) is a probability 
measure on B for each 9, and for each B £ B, P(B\-) is C-measurable. 

Now let is(d9) be a cr-finite measure on C, and for each B G B, consider 
the marginal measure given by 



(6) 
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Throughout, the marginal measure M is assumed to be cr-finite. Under this 
assumption, there is a Markov transition function Q(-\x) that satisfies 

(7) P{dx\e)v{d6) = Q{d6\x)M(dx). 

This equation means that the two joint measures on X x given by (7) 
agree. For a discussion of the existence and uniqueness (a.e. M) of Q(-\x); 
see [7]. Of course, Q(-\x) is called the formal posterior distribution of 9 given 
X = x. 

2.2. Strong admissibility and recurrence. As in Section 1, let be a 
bounded, measurable, real-valued function defined on and consider the 
problem of estimating <j){9) when the loss is (1). Then 4>{x) given in (2) is 
the formal Bayes estimator obtained from the formal posterior Q{-\x). Using 
the risk function defined in (3), here is an appropriate notion of admissibility, 
due to C. Stein, for our setting. 

Definition 2.1. The estimator to(x) for 4>(9) is almost-v -admissible (a- 
if for each estimator t that satisfies r(t,9) < r(to,9) for all 9, the set 
{9\r(t,9) < r(to,9)} has ^-measure zero. 

Definition 2.2. The improper prior v, or equivalently, the formal pos- 
terior Q(-\x), is strongly admissible if for each bounded measurable </>, the 
estimator <j> is a-z^-a. 

The boundedness assumption on (j) greatly simplifies the technical issues 
surrounding our discussion. For some parallel results regarding the estima- 
tion of unbounded functions, see [9]. 

In this section two notions of recurrence are useful. To describe these, 
recall the transition function given in (4). This transition function deter- 
mines a Markov chain W = (Wo, Wi, . . .) with each W{ € 6, i = 0, 1, 2, 

The path space of W is 0°° and given that Wo = wo, the probability distri- 
bution of W is denoted by P wo . In some situations we will call W the P-v 
chain to emphasize its dependence on the model and the prior. 

A measurable subset C C is u-proper if < v(C) < oo. Let C be a 
^-proper set and consider the stopping time tq defined on 0°° by 

f oo, if W n iC for ah n>l, 

l_ smallest n > 1 such that W n G C, otherwise. 

Also, let E c = {w\t c {w) < oo} C 0°°. 

Definition 2.3. The i^-proper set C is locally-^ -recurrent (l-z/-r) if the 
set {wo E C\P Wo (Ec) < 1} has i^-measure zero. The ^-proper set C is u- 
recurrent if the set {wo\P m (Ec) < 1} has ^-measure zero. 
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Definition 2.4. The Markov chain W is locally-v -recurrent (l-i/-r) if 
every ^-proper set is l-z/-r. 

The following basic result was established in [8]. 

Theorem 2.1. If the Markov chain W is l-v-r, then Q(-\x) is strongly 
admissible. 

The above theorem is often difficult to apply since every z^-proper C must 
be shown to be l-v-r. The following result ([10], Theorem 3.1), eases the 
burden somewhat. 

Theorem 2.2. The following are equivalent: (i) the chain W is l-v-r 
and (ii) there is an increasing sequence of v-proper sets Ci, i= 1,2,..., 
such that Ci — ► and each Ci is l-v-r. 

The next result facilitates the application of the recurrence ideas to the 
admissibility issue. It is sometimes called the "one-set criterion." See [10] 
for a proof. 

Theorem 2.3. Let C* be a v-proper set and suppose C* is v-recurrent. 
Then every v-proper set C is l-v-r. 

Section 4 is devoted to the problem of finding a single recurrent set so 
that the above theorem can be applied to nontrivial examples. 

The Markov chain one-set technique of Section 4 and the dimension re- 
duction method of Section 3 are the basic contributions of this paper. The 
dimension reduction method relies on a rather deep connection between l-v-r 
and the behavior of the Dirichlet form associated with the Markov chain W 
generated by the transition function R(d9\rj) and the measure v. 

In what follows, L^iy) denotes the linear space of i/-square integrable 
functions. Let h G L2(v), then the quadratic form 



is a Dirichlet form (see [6] and [13] for background material). The relevance 
of A for statistical decision problems with quadratic loss is discussed in [10]. 
Given a improper set C, let Iq denote the indicator function of C and let 



Here is a basic theorem established in [8] that relates the Dirichlet form A 
to questions regarding admissibility. 



(8) 



A ( h ) = l J f(h(d)-h( V )) 2 R(dB\r,)u(dr,) 



(9) 



V(C) = {h\heL 2 (v),h bounded, h>I c }. 



EVALUATION OF POSTERIOR DISTRIBUTIONS 



7 



Theorem 2.4. Fix a u-proper set C . The following are equivalent: (i) 
C is l-v-r and (ii) in£heV(C) = 0. 

Theorems 2.2, 2.3 and 2.4 are used in the next section. 

3. A reduction argument. Again consider a model {P(dx\9) \ 9 G 0}, a 
o"-finite prior v(d9) with M(dx) assumed to be cr-finite. Hence, there is a 
Markov kernel Q(d6\x), a formal posterior for 9 given x, so that (7) holds. 
The transition function R(d9\rj) defines the P-v Markov chain W and the 
related symmetric measure 

R{d9\n)u(dr,) = [ Q(d9\x)Q(dr,\x)M(dx). 
Jx 

This in turn defines the Dirichlet form A given in (8). Next, consider a 
measurable mapping t from (0,C) to ([0, oo), £>i). The map t induces a 
measure s on the Borel subsets B\ of [0, oo) given by s(A) = u(t~ 1 (A)). 

Assumption 3.1. There is a sequence of disjoint Borel sets Ai C [0, oo) 
for i = 1,2,3,... such that 

oo 

[J Ai = [0, oo) and Bi = t^ 1 (Ai) C is zA-proper. 

i=l 

Theorem 3.1. Suppose Assumption 3.1 holds. Then there is a Markov 
transition function w(d9\a) on C x [0,oo) such that 

(10) v(d9)=Tr{d9\a)s{da). 

Equation (10) means that for all measurable nonnegative functions fi(9) and 
f2(t(9)) defined on and [0,oo) respectively, we have 

(11) jT f 2 {t{9))h{9)v{d9) = J™ f 2 (a) ^ /i(e)7r(d(?|o)) s(da). 

If v is a finite measure, then the assertion of Theorem 3.1 is nothing 
more than the existence of a conditional distribution [on given t{9) = a] 
ir(d9\a), where s is the marginal distribution of t(9). When v is allowed to 
be cr-finite, Assumption 3.1 allows one to argue separately on the spaces 
Bi C and Ai C [0, oo) using standard techniques; see [18], pages 43 to 52. 
The details are omitted. 

Example 3.1. Consider u(d9) = g (\\9\\ 2 ) d9 on9 = r with p>l. As- 
sume the measurable function go is non-negative and v is cr-finite. Consider 
the mapping t on defined by t(9) = \\9\\ 2 G [0,oo). Let it{d9\0) denote the 
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uniform probability distribution on {9\ \\9\\ 2 = 0} with the obvious extension 
of 7r(- to 0. A routine argument shows that u{d9) = ir(d0\(3)s(d(3), where 

(12) s(d(3)= ^'fl UPW'^W on[0,oo). 

r(p/2) 

Remark 3.1. Without Assumption 3.1, Theorem 3.1 is typically false. 
To see part of the difficulty, let = [0, oo) x [0, oo), let v be Lebesgue mea- 
sure on and let t(#i,#2) = 0\ £ [0, oo). It is not too hard to show that a 
ir(d8\a) satisfying (11) does not exist. 

For the remainder of this section, Assumption 3.1 holds, so the conclusion 
of Theorem 3.1 holds. Now, introduce the new model 

P(dx\a) = { P{dx\e)ir(d6\a), ae[0,oo). 
Jo 

The "prior" s(da) is cr-finite (this follows from Assumption 3.1) and the 
marginal on X is the cr-finite measure M(dx). That M{dx) is still the 
marginal follows from 

/*oo r 

P(dx\a)s(da) = / / P(dx\e)ir(d9\a)s(da) 
Jo Je 

P(dx\6)u(d0) = M(dx). 

e 

Thus, there is a formal posterior Q{da\x) that satisfies 

(13) P{dx\a)s(da) = Q{da\x)M(dx). 

The model P(dx\a) together with s determines a Markov chain, the P-s 
chain, with transition function 



R{da\b) = I Q(da\x)P(dx\b) 

and Dirichlet form 



oo roo 



A(h) = i / / (h(a) - h(b)YR{da\b)s{db). 
Jo Jo 

The rest of this section is devoted to proving that if the P-s chain is locally- 
s-recurrent (1-s-r), then the P-v chain is 1-z^-r. This shows that the 1-s-r 
of the P-s chain implies strong admissibility for the original problem (see 
Theorem 2.1). To describe our first result, let Qq be the Markov kernel on 
£>i x X defined by 

(14) Q (A\x)=Q(t-\A)\x). 
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Theorem 3.2. The Markov kernel Qq serves as a version of Q in (13). 

Proof. Consider nonnegative measurable functions v on X and on 
[0,oo). Then using the definition of Qq yields 



e Jx 



x Jo 

00 



v(x)i/j(a)P(dx\a)s(da) = / / v(x)i/j(a)P(dx\e)ir(dO\a)s(da) 
Jx Jo JxJe 

v(x)ij)(t{6))P{dx\6)v(d6) 
v(x)^(t(6))Q{d6\x)M{dx) 
v(x)ijj(a)Qo(da\x)M(dx). 1-1 

x Jo 

Theorem 3.3. Consider h G L 2 {s) and define h* on 6 by h*{9) = 
h(t(6)). Then h* G L 2 (v) and A(/i) = A(h*), where A and A are the Dirich- 
let forms for the P-s chain and the P-v chain respectively. 

Proof. That h* G L,2(v) is obvious. Using the definition of h* , Qq, and 
Theorems 3.1 and 3.2, we have 

A(h*) = l[ [ [ (h*(6)-h*(r])) 2 Q(de\x)Q(d V \x)M(dx) 
JeJeJx 

= \\ I I (h(t(8))-h(t(r]))) 2 Q(de\x)Q(dn\x)M(dx) 
JeJeJx 

r (h(a) - h(b)) 2 Q (da\x)Qo(db\x)M(dx) = A(h). n 

Theorem 3.4. If the P-s chain is l-s-r, then the P-v chain is l-v-r. 

Proof. Let A be an s-proper subset of [0,oo). By Theorem 2.4, 

^inf A(/t) = 0, 
h&V(A) 

where V{-) is the analog of V(-) [defined by (9)] for the P-s problem. Now, let 
{Di} be an increasing sequence of s-proper subsets of [0,oo) such that Di /* 
[0,cxo). Setting Ei = t~ 1 (Di) for i = 1,2,..., we see that v{Ei) = s(P>i) G 
(0,oo) and Ei /* 0. By Theorem 2.2, it is sufficient to show that each Ei is 
l-v-r. But using Theorem 2.4 shows that it is sufficient to verify that 



00 roo 



(15) 



inf A(h) = 0. 

h&V{E t ) 
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To show (15), let e > be given and select h G V(Di) so that A(/i) < e. 
Then set h* (9) = h{t{6)) . Clearly, h* G V{Ek) and by Theorem 3.3, A(h*) = 
A(h) < e. Thus, (15) holds. □ 

Remark 3.2. The choice of [0, oo) for the range of t is purely for sim- 
plicity and the direct application to Example 1.1. Any other choice of Polish 
space for the range of t yields the same results as long as Assumption 3.1 
holds. A convenient choice for t and its range is dependent upon the appli- 
cation. 

4. Recurrence of Markov chains. In this section we discuss the recur- 
rence of a discrete time Markov chain with values in a Polish space X. The 
a- algebra generated by the open sets of X is denoted by B. The notation 
and setting in this section are independent of that in the earlier sections. In- 
deed, the material here may be of independent interest since some nontrivial 
generalizations of results in [17] are given below. 

Consider a discrete time Markov chain X = (Xo,Xi, . . .) with state space 

X so each X\ is an element of X, i = 0, 1, Therefore, X takes values 

in the product space X°° which is equipped with the natural product a- 
algebra B°°. The one-step transition function of the chain, assumed to be a 
Markov kernel on B x X, is denoted by T(-\y) for y E X. Given xq G X, let 
P(-\xo) denote the distribution of X 6 X°° when Xq = xq. Thus, P(-\xq) is 
a probability measure on B°°. 

Now, let C be a Borel subset of X . Define the stopping time 

f oo, if X n 4 C for all n > 1, 

[ smallest n > 1 such that X n G C, otherwise 

and let E c = {w G X°°\t c (w) < oo}. 

Definition 4.1. The set C is P(-\x$) -recurrent if P(Ec\x ) = 1. The 
set C is recurrent if C is P{-\x$) -recur rent for all xq G X . 

Remark 4.1. If C is recurrent and z/-proper, in the language of Sec- 
tion 2, then C is z^-recurrent. 

Proposition 4.1. If C is P{-\xq) -recurrent for all x £ C c , then C is 
recurrent. 

Proof. This is a standard Markov chain argument which is omitted. 

□ 
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4.1. A condition for recurrence. Let / be a Borel measurable function 
denned on X with values in [0, oo). Define a sequence of random variables 



where A denotes the minimum. Obviously, Y{ = f{Xi) for i = 0,1. The proof 
of the following is omitted. 

Lemma 4.1. With Ec as defined above, if w £ Ec, then the sequence 
{Y n {w) | n = 0, 1,2, . . .} converges to a finite limit, namely, f(X Tc r w \(w)). 

Here is an important structural condition on / and the chain X. 

Assumption 4.1. Given xq g X, 



Theorem 4.1. Let H C X°° be the set of w 's such that Y n (w) converges 
to a finite limit. Suppose Assumption 4-1 holds and that P(H\xq) = 1. Then 
the set C is P(- \ xq) -recurrent. 

Proof. Let Eq be as defined above. Then by Lemma 4.1, Ec C H. 
Thus H = Ec U (H n E c ). To show that C is P{- \ x )-recurrent, it suffices 
to show P(H n E c | xq) = since then, P(fl"|x ) = ^(^cko) = 1- Let 



Then, since P(K\x ) = 1, we have P(fl" n E° c \ x ) = P(H nE c nK\x ). 
Since H n P^. fl K is empty, this completes the proof. □ 

The above shows that when Assumption 4.1 holds, P{-\xq) -recurrence of 
C will hold if we can show that {l^|n = 1, . . .} converges to a finite limit a.s.- 
P(-\xq). Of course, if {Y n } is a P(-\xo) supermartingale, then H has P{-\xq) 
probability one and Theorem 4.1 applies. It is this martingale argument that 
[17] used. 

4.2. When is {Y n } a supermartingale? Given the Markov chain X = 
(Xq,Xi ,.. .) with state space X and Xq = xq, let T n be the cr-algebra in £>°° 
generated by Xq, . . . , X n . Recall that {Y n ,T n \ n = 0, 1,2...} is a P(-\xq) 
supermartingale if 



Y ,Y U Y 2 ,... by 




n = 0,1,... 
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Definition 4.2. A function /: X — > [0, oo) is superharmonic on C c if 

(17) E(f(Xi) | X = x ) < f(x ) for all x G C c . 

Theorem 4.2. If f is superharmonic on C c , then (Y n ,T n | n = 0, 1,. . .) 
is a supermartingale for each xq G C c . 

Proof. Recall that Y { = f(Xi) for i = 0, 1 and fix x <£ C. That (16) 
holds for n = is a direct consequence of (17) since xq ^ C. 

For n > 1, let G n E ^ n be the event G n = {X x C, . . . , X n $ C) = {t c > 
n}. Obviously, E{Y n+l \ T n ) = E{I G cY n+l \ T n ) + E{I Gn Y n+1 | T n ). On the 
set G c n , Y n+ i = Y n so 

(18) E(I G c n Y n+1 | T n ) = E{I G cY n | T n ) = I G cY n . 
On the set G n , t g > n + 1 so Y n+ \ = f{X n+ i) and 

E(I Gn Y n+1 | T n ) = E(I G J(X n+1 ) | T n ) = I Gn E(f(Xn+l) I Tn) < I G J(X n ). 

The last inequality follows from the Markov property and the assumption 
that / is superharmonic on C c . Thus, on the set G n , 

(19) E(I Gn Y n+1 | T n ) < I G J(X n ) = I Gn Y n . 
Combining (18) and (19) shows (16) holds. □ 

Corollary 4.1. Suppose f is superharmonic on C c . Then for each 
xq G C c , {Y n } converges a.s.-P(- \ xq) to a finite random variable Y. 

Proof. This is just the Supermartingale Convergence Theorem since 
< EY n < EY\ for all n, so < sup n EY n < EY X ([3], page 468). □ 

Combining what has now been established, we have the following result. 

Theorem 4.3. Suppose Assumption 4-1 holds for xq G C c and that f is 
superharmonic on the set C c C X . Then the set C is recurrent. 

Theorem 4.3 parallels some results in Chapter 8 of [19] in the use of 
superharmonic functions. An important difference is that Meyn and Tweedie 
assume the Markov chain is irreducible, while we rely on Assumption 4.1. 
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4.3. The case when X = [0, oo) In this section it is assumed that the 
state space X is [0, oo). We develop sufficient conditions on the chain X = 
(Xq,Xx, . . .) so that a particular function / is superharmonic on the set 
[m, oo) when m is large enough. In addition, we provide a sufficient condition 
for Assumption 4.1 which may be easy to check in examples. When these two 
results hold, it will follow from our previous results that the set C = [0,m) 
is recurrent. An application of this result is provided in Section 5. 

We begin with a statement of a main result, although several auxiliary 
results are needed before the proof can be completed. For k = 1, 2, 3, let 



(20) Hk(x)= (y-x) k T(dy\x). 

Jo 

The quantity 

/•oo 

\y — x\ s T(dy\x) 



o 

is assumed finite for all x £ [0, oo), so Hk(x) is well defined for all x G [0,oo) 
and k = 1,2,3. We also assume that ^{x) > for all sufficiently large x. 

Theorem 4.4. Assume that there is a function ipi such that, for all 
sufficiently large x, 

(21) /ii(i)<^[l + ^i(i)] 
and 

(22) lim (logxW>i(x) =0. 

x — >oo 

Also, assume that 

(23) hm^^44 = 0. 

x^oo x ^2\X) 

Then, for x>0, 

(24) / (aO = lGg(log(e + s)) 

is superharmonic on the interval [m, oo) for m large enough. 

What needs to be established to prove Theorem 4.4 is that, for all large 

x, 

(25) 5(x) = £(/ (*i) - fo(x) \X = x)<0. 
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To this end, note that the first four derivatives of /o satisfy 

1 



>0, 



<0, 



(x + e) \og(x + e) 
log(x + e) + 1 
(x + e) 2 log 2 (x + e] 

21og 2 (x + e) + 31og(x + e) +2 
(x + e) 3 log 3 (x + e) 

61og 3 (x + e) + 71og 2 (x + e) + 12 log (a; + e) + 6 



>0, 



<0. 



(x + e) 4 log 4 (x + e) 

Now, expanding /o in a Taylor series about x in (25), discarding the negative 
term f^ in this expansion, and doing a bit of algebra results in 

fo(x)fJ, 2 (x) 



(26) 6(x) < 



2x 



2xin(x) fj{x)_ 2f(f'(x)fi 3 (x)x 

£f ( \ 



V2(x) f' {x) 6f^(x)/i 2 (x) 



For notational convenience, set 



2fd'(x)ii 3 (x)x 



§f'{x)H2{x) 

Our first intermediate conclusion is the following, which is a direct conse- 
quence of (26) and (21). 



(27) 



Lemma 4.2. For all sufficiently large x, with 5{x) as defined in (25), 



6{x) < 



2x 



Jo\ x ) 



Lemma 4.3. The right-hand side of the inequality (27) is equal to 



(28) m r (x) xf>(x) 



2x 



1 | elog(x + e) | j>i(x) i ip 2 (x) 



x 



xfo(x) xf' Q {x) 



(29) 



Proof. First verify by direct calculation that 
f Q {x) + xf^x) 



-x + elog(x + e). 



Multiplying and dividing the bracketed term in (27) by xf^(x) and using 
(29) immediately yields the claim. □ 

Now, we complete the proof of Theorem 4.4 by showing that the bracketed 
term in (28) is negative for all large x. But, this is a direct consequence of 
assumptions (22) and (23). For example, 

tpi(x) x + e 



xfo(x) 



x 



■\og(x + e)ipi(x), 
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which obviously converges to zero as x — > oo under assumption (22). With 
a bit more algebra, (23) implies that tp2(x) / x f'${x) converges to zero as 
x — > oo. Therefore, the right-hand side of (27) is negative for all large x. 
Thus, 5(x) < for all x large enough and the proof is complete. 

We now turn to a discussion of the structural Assumption 4.1 when X = 
[0,oo) and the function / is /o in (24). Because /o is monotone increasing 
from [0,oo) onto [0,oo), it is clear that Assumption 4.1 holds for f = fo if 
and only if the following holds: 



Condition 4.1. Given x G [0, oo), 

P lim sup X n = oo 



n—roo 



Xq ] = 1. 



Proposition 4.2. Define the event 

A m , k = {X x G [0, m], X 2 G [0, m) , . . . , X k € [0, m] } 

for positive integers m and k. Assume that, for each xq and each m £ N, 
P(A-m,k\ x o) ^0 as k — > oo. Then Condition 4-1 holds for all xq. 

PROOF. Fix x . For m G N, define E m = {X n G [0,m] for n = 1,2, .. .}. 
Let E = U^ = i E m . Clearly, P(E\x Q ) = if and only if P(E m \x ) = for all 
m G N. Define 

lim sup X n = oo 

and note that E c = F. Hence, P(F\xo) = 1 if and only P(E m \xo) = for 
all m G N. Note that A m ^ [ E m . By assumption, for each fixed m G N, 
P(A mj k\xo) — > as k — > oo. Thus, P(E m \xo) = for all m G N and the result 
is proved. □ 

Proposition 4.3. Assume that for each positive integer m there exists 
a 5 = 5(m) < 1 such that 

(30) sup T([0,m] | x ) < 5. 

xo£[Q,m] 

Then Condition 4-1 holds for all xq. 
PROOF. Fix xq and m G N. Note that 

P(A m>k \x )= / ••• / T(dx 1 \x )---T{dx k _ 1 \x k _ 2 )T([0,m]\x k _ 1 ) 
Jo Jo Jo 



< T([0,m] | x )8 
<5 k -\ 



k-l 
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Thus, P(A m ±\xo) — > as k — > oo and an application of Proposition 4.2 com- 
pletes the proof. □ 

In summary, the main conclusion of this section is the following. 

Theorem 4.5. Assume that X = [0, oo) and that the assumptions of 
Theorem 4-4 an d Proposition 4-3 hold. Then there exists anm£ (0, oo) such 
that the set C = [0, m) is recurrent. 

It is Theorem 4.5 that is used in the application of the next section. 

5. Strongly admissible priors for the multivariate normal mean. We now 

use our results to identify strongly admissible priors for the mean of a mul- 
tivariate normal distribution. Recall the setting of Example 1.1. Assume 
that X ~ N p (9,Ip) and take the prior to be v a ,b{d9) = d9/(a+ ||#|| 2 ) 6 , where 
a > 0, b > 0, and d9 is Lebesgue measure on W. 

The prior is improper for all b > 0, but the marginal is cr-finite 
only when b < p/2. On the other hand, when a > 0, u ab is improper only 
when b<p/2 and the marginal is cr-finite for all b in this range. We there- 
fore restrict attention to b £ (0,p/2) when a = and to b £ (0,p/2] when 
a > 0. When these conditions are satisfied, a proper posterior distribution 
Qa,b(d9\x) exists and satisfies the disintegration 

P(dx\9)u a>b (d9) = Q a , b (d9\x)M a , b (dx). 

Here is the main result of this section. 

Theorem 5.1. Suppose X ~ N p (9,I p ) with p>3. The prior v a ^ [equiv- 
alently, the posterior Q a ,b{d9\x)] is strongly admissible if either (A) a > 
andb£ [p/2 - 1, p/2] or (B) a = and b £ [p/2 - 1, p/2). 

Remark 5.1. The problem we address is substantively different from 
the traditional problem of estimating 9 under quadratic loss. However, one of 
the results established in [2] is that when a = 1 and b = (p — l)/2 the formal 
Bayes estimator of 9 is admissible under quadratic loss. The argument in [2] 
uses the general results in [4] . It seems plausible that the results in [4] could 
be used to obtain admissibility, under quadratic loss, of the formal Bayes 
estimator of 9 for any of the priors u a b in Theorem 5.1. 

Proof of Theorem 5.1. Throughout, we will refrain from using sub- 
scripts on i^, M and Q. In Example 3.1 it is shown that the prior v(d9) can 
be expressed as Tr(d9\f3)s(d(3), where 7r(- is the uniform distribution on 
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{<9|||6»|| 2 = 13} and s{d(3) is given in (12) with g ((3) = (a + j3)~ b . The new 
model, P, has density (with respect to Lebesgue measure on W) given by 

f(x\P) = ^(2^/^^)11-^11^), 

where tti is the uniform distribution on H = {#|||#|| = 1}. The formal poste- 
rior for (3 can be written as Q{d(3\x) = q([3\x)d(3, where 

pi) mx) = <hm S f)P'^ 

m(x) 

with c a positive constant and 

(32) m(x) = c f(x\f3)g (P)f3 p/2 - l d(3= f f(x\e)g (\\6\\ 2 )d6, 

JO JRP 

where f(x\9) is the multivariate normal density with mean 8 € W and co- 
variance matrix the px p identity, I p . The P-s chain has Markov transition 
function given by 

(33) R{df3\rf)= f Q(d(3\x)f{x\r])dx. 

Jrp 

According to Theorem 3.4, to prove that v is strongly admissible, it suffices 
to show that the P-s chain is 1-s-r. This is now established by showing that 
the conditions of Theorem 4.5 are satisfied. 

Note that f{x\rf) and q(f3\x) are strictly positive for all (3,7] £ (0,oo) and 
x £ ffi p . It follows that R(C\rj) > for every rj £ [0,oo) and every C with 
positive Lebesgue measure. In order to apply Proposition 4.3, we need to 
show that, for any m G N, there exists a 8 < 1 such that 

sup R([0,m]\r]) < 5. 

r]&[0,m] 

It suffices to show that R([0, m]\r]) is a continuous function of 77 for rj G [0, m}. 
Fix rf £ [0,m] and, without loss of generality, let {rjk} ( k = i be a sequence in 
[0,4m] that converges to rf . An application of dominated convergence shows 
that f(x\rjk) — > f(x\r]*) as k — > 00. Let g :MP — > E such that 

g(x)=I(\\x\\ <A^) + I(\\x\\ >4Vm>- (1/2)M2 . 

Since g is integrable and f(x\rjk) <g{x) for all x and all /c, another applica- 
tion of dominated convergence yields the desired continuity. 

We now turn our attention to establishing the conditions of Theorem 4.4. 
Recall that 

Mfcfa):=/ (P-ri) k R(dP\v)- 
Jo 
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In the appendix we prove 



Hi(rj) = 2p-4b + ip*(rf), /x 2 0?) = 877 + i>\ (77) and ^3(77) = ^3(77), 

where, as 77 — > 00, (77) = 0(r/ _1 ), VK 7 ?) = ^(1) and ip$(rj) = O^)- Set 
t/>i(t?) = rj~ e for some e € (0, 1). Then (22) holds and 



so (21) holds. Furthermore, (23) holds since, as 77— >oo, 
log 77^3(77) _ log 77 7/^(77) _ ^(77) log 77 

77 ^2(7?) »7 877 + ^2(7?) 77 877 + 7^2(77) n 

Let v{d9) be an improper prior on = W and suppose that the P-v 
chain is \-v-r so that v is strongly admissible. A result in [11] shows that 
(under mild conditions) if h:W — > [0,oo) is a bounded function, then the 
"perturbed" prior v*(d6) = h(9)v{d6) is also strongly admissible. In fact, 
Corollary 4 in [11] in conjunction with the results in the proof of our Theo- 
rem 5.1 immediately yields the following: 

Theorem 5.2. Suppose X ~ N p (6,I p ) with p>3. Let h:W -> [0,oo) 
be bounded and suppose the perturbed prior Vh,a,b{d0) = h{6)v a ^{d9) is im- 
proper. Then fh,a,b is strongly admissible if either (A) a > and b G [p/2 — 
l,p/2] or (B) a = and be [p/2 - 1, p/2). 

Remark 5.2. It is possible to get most of part (A) of Theorem 5.1 by 
combining the proof of part (B) with the work in [11]. Fix b £ [p/2 — 1, p/2). 
We know from the proof of part (B) that the Markov chain associated with 
the prior vo,b{d8) = d0/(\\8\\ 2 ) b is l-^o,fe _r - Now fix a > 0, and note that 



Since (||#|| 2 /(a+ ||#|| 2 )) b is a bounded function, the results in [11] imply that 
v a f> is strongly admissible. 

Finally, using arguments somewhat similar to those in the proof of The- 
orem 5.1, the multivariate Poisson case was discussed in [16]. 



/j 2 (77)[l + ^(77)] = 877 + Sr? 1 ^ + O(l). 



Since b>p/2 — 1, it follows that 



2777/1(77) = (4p - 86)77 + 0(1) < 877 + O(l) 
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Throughout this section, pG {3,4,...}, a > and b > are considered 
fixed and the assumptions of Theorem 5.1 are in force. We begin by recalling 
some facts concerning the noncentral x 2 distribution. Suppose that Z ~ 
N p (j,I p ) and let A= ||7|| 2 . Then V = \\Z\\ 2 ~Xp(A) and its density at v > 
is given by 



(34) 



E 



' x / 2 (X/2) n (v/2) n+ P/ 2 - 1 e- v / 2 



n=0 



n! 2T(n + p/2) 

We will need the first three moments of V, which are as follows: 

E[V\X] = A + p, E[V 2 \X] = (X+p) 2 + 4A + 2p, 

E[V 3 \X] = (A + p) 3 + 12(A +pf - 6Xp + 24A - 6p 2 + 8p. 

Recall that g (z) = (a + z)~ b . Using (34), one can show that E[g Q (V)V k \X) 
2 k E[w k (N)\X], where A|A ~ Poisson(A/2) and, for n G Z + := {0, 1, 2, . . .}, 

f°° go{z)(z/2) n+ P/ 2+k ~ l e- z / 2 
w k {n) := — 
Jo 

When a = and p/2 - b > 0, 



(35) 



w k (n) 



2T(n + p/2) 
T(n + p/2 + k-b) 



■ dz. 



2 b T{n + p/2) ' 

which is well defined even when n = k = 0. Our goal is to prove the following. 



Proposition A.l. For rj > 0, we have 

fii(r]) = 2p-Ab + tpl(r]), // 2 (j)) = 8 V + ^2 (v) and ^(v) = ^Kv), 
where, as rj —> oo, tpK 7 ]) = 0(n -1 ), tp^iv) = 0(1) and i/j^(t]) = 0(rj). 

Proof. The proof is constructed in several intermediate steps. Define 
m k (() : R + -> R + via m fc (C) = £[3o( W fc |C] with U ~ Xp(C)- Our first result 
follows. 

Proposition A. 2. Frc 77 > and iei F ~ X 2 {v)- For k G {0, 1,2,3}, 

m fc (F) 



(36) 
and 
(37) 



E 



m (Y) 



< oo 



[3 k R{d[3\r,) = E 



m k (Y) 



mo(Y) 
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Proof. We begin with (36). Since go is convex, we can use Jensen's 
inequality to obtain 

m k (() m fe (C) , , b 

77Y < 77 , \ = (a + C + P) m k (Q. 

mo{() go((+P) 

Hence, it suffices to show that E[Y b m k (Y) \ rf\ < 00, but this follows from 
the fact that Y has a moment generating function. 

We now establish (37). Recall that n\ is the uniform distribution on S = 
= 1}. Then 

■ 1 

V | = (2^)^/ 2 e-( 1 / 2 )ll^^ll\ 1 (^) 5o ( / 3)/?p/2-i d/3 

50 (||^|| 2 )(||e|| 2 ) fc (2^)- p/2 e-( 1 / 2 )!l :c - e ll 2 ^. 
The last expression is equal to 

E[g (\\e\\ 2 )(\\o\\ 2 )% 



where 6 ~ N p (x, I p ). Of course, if 6 ~ N p (x, I p ), then ||6>|| 2 ~ XpGM 



(5 k ~f{x\fi)g Q (f3)(3 p/2 - 1 dl3 = m k (\\x\\ 2 ). 

^f3 k f{x\P)gy(f3)(3Pl 2 - l d{3 _ m k {\\xf) 



2 ). Thus, 



Now note that 



(3 h Q{d(3\x 



Jo°° f(x\(3)g Q ((3)(3P/^d(3 m (IMI 2 )' 
To complete the proof, note that 

|2\ 



0*R(d0\ri) 



f3 k Q(d(3\x) 



f(x\rj) dx 



m k {\\x\\ 
rp m (||x|| 2 ) 



f(x\r))dx 



m k (\\x\\ 2 ) 
rp J~ m (||x|| 2 ) 



(-^Je-^/^-^f^dOdx 



m k l x 



3 [Jw moduli 2 ) \\/2tt 



<m\\x-vvz\\ 2 dx 



Tri(de). 



Consider the inside integral. If X ~ N p (y/rj£,I p ), then, since Hy^H 2 = ??, it 

m fc (y) 



follows that ||X|| 2 ~ Xp(^)- Hence, the inside integral can be expressed as 



E 



m (Y) 



EVALUATION OF POSTERIOR DISTRIBUTIONS 



21 



where Y ~ xl,{i]), an d this does not depend on £. Therefore, 



P k R(d(3\v) 



m k (Y) 



m (Y) 



ir(d^) = £ 



m fc (y) 



mo 00 



□ 



It now follows from the discussion at the beginning of this appendix that 
m k {y) = 2 k E[w k (N)\y], where N\y ~ Poisson(y/2) and 



H2(ti)=4E 
fi 3 (v)=8E 



E(wi(N) 



E(w (N) 
E(w 2 (N) 



E(w (N) 
E(w 3 (N) 



E(w (N) 



Y) 



Y) 
Y) 



Y) 
Y) 



Y) 



AnE 
12nE 



E( Wl (N)\Y) 



E(w (N)\Y) 
E(w 2 (N)\Y) 



E(w (N)\Y) 



+ 6rj 2 E 



E( Wl (N)\Y) 



E(w (N)\Y) 



V 



where N\Y = y ~ Poisson(y/2) and Y ~ Xp(??)- 

We now begin working on Wk(n). Our first result is as follows. 

Proposition A. 3. SupposeneZ + and h e {1,2,3}. 
1. If a = 0, then 



(38) 



r -r=n+r + fc-6-l. 

w k -\{n) 2 



(39) 



2. // a > 0, then 
w k (n) 



n + ^ + k - b - 1 + cj){n + k - 1), 
Wfc_i(n) 2 



where <fi : Z + — > (0,oo) is given by 

SS°(ab/(a + z))g (z)(z/2)^/ 2 - 1 e -/ 2 rfz 
9{ >' J^g (z)(z/2) n +P/ 2 - 1 e- z / 2 dz 

Moreover, 4>{n) is bounded and 4>(n) = 0{n~ l ) osn-> oo. 

Proof. The a = result follows directly from (35). Now assume that 
a > and define k(z) = ab/(a + z). Note that 



(40) 
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where g' denotes the derivative of g$. Integration by parts yields 

i poo / ~\ n+p/2+k— 1 

Mn) = rf , / %{z)e- z/2 (-) dz 



T(n + p/2) Jo 

+ (n + | + fc- 1 )w k -i(n). 



An application of (40) yields 

i / „\ n+p/2+fc-l 

3o(*K z/2 (f) cfa 

1 /-oo /r \n+p/2+fc-2 



2 



6 + K ( z )] 5o (z) e -/2(Jj ^ 



. 6r ^n + |J«; fc _ 1 (n) + -jf ) K (z) 5o (z) e - z / 2 ^-J dz. 



It follows that 



tc fc (n) _ n + P +k _ b _ l + (j){n + k _ l y 



w k -i(n) 2 

Since «(z) is bounded above by b, it is clear that 4>{n) is also bounded above 
by b. Now define N(n) and D{n) as follows: 

_ N(n) _ j™{ab/{a + z))g {z){z/2Y+vl^e-^dz 
nU) D{n) J Q x g {z)(z/2) n +P/ 2 - 1 e- z / 2 dz 

Using Jensen's inequality, we obtain 

/•oo / sn+p/2-1 

D(n) = J o go(z)(~) e- z ' 2 dz 

roo z n+p/2—X e —z/2 
= *T<» +!-/*)/ 9^) ,^ 



2Y{n + p/2) 
~ (a + p + 2n) b ' 

As long as n > b + 1 — p/2, we have 



< °*L = r z n+ vl 2 - b - 2 e- z l 2 dz 



2 n+p/2-\ J Q 

abT(n + p/2-b-l) 
2^ 
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Putting these two bounds together, we have 

ab fa + p \ b h+l T{n+p/2-b-l) 
nd>(n) < — 7— r \- 2 ) n 



2 b + 1 \ n J T(n + p/2) 

Finally, according to [1], page 257, 

b+1 T(n + p/2-b-l) 

n ^ — — > 1 as n — > oo, 

T(n + p/2) 

from which it follows that 4>(n) = 0(n~ l ). □ 

A simple calculation shows that, for n G Z + and k £ {1, 2, 3}, we have 
w k (n) = + 0u; fe _ 1 (n+ 1). 
This combined with (39) yields, when a > 0, 

(41) (n + p/2)u; fc _ 1 (ra + 1) = (n +p/2 + k - b - 1 + <j)(n + k - l))w k -i(n). 

Proposition A. 4. Suppose N\y ~ Poisson(y/2) where y > and let 
fc€{l,2,3}. 

1. If a = 0, then 

E[w k (N)\y] 



E[w k ^(N)\y] 

_P. b h y y{k-b-l)E[w k . 1 {N)/i < N + pl2)\y] 

1 1 2 E[w k ^(N)\y] 

2. If a > 0, i/ien 

E[w k (N)\y] 
£[u/ fc _i(JV)|y] 

= g , , 1 y j/(fc-6-i),gK-i(^)/(^+p/2)|y] 

2 + + 2 + 2 £?[«; fc _i(JV)|y] 
y E[<j)(N + k — l)w k ^(N)/(N+p/2)\y] 



(43) 



+ 



+ 



2 E[w k ^(N)\y] 
E^N + k - l)w k ^(N)\y] 



E[ Wk ^{N)\y] 
Proof. First, 

,e~y/ 2 (y/2) n 



E[Nw k ^(N)\y} = £ nw fc _i(n)- 



n=0 



J?! 
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-f/2(y/ 2 )n-l 



y sr f \' 
2 L^-h")' 



| 51 u>*-i(m + l) 



n=l 

oo 



(n- 1)! 

e^ /2 (y/2) 



m=0 



???.! 



= |£;[ti; fc _ 1 (JV+l)|y]. 

We now prove the result for a > and we note that a proof for the a = case 
can be constructed simply by replacing (ft by in the following argument. 
Rearranging (39) and taking expectations yields 



E[w k (N)\y] 



6-1 £K_i(iV)|y] 



(44) 

+ £7[iV«; fc _i(JV)|y] + El^N + k- l)w k -i(N)\y] 
A rearrangement of (41) yields 

2(k-b-l + cf)(n + k-l)) 



w k _i{n + 1) = w k _i(n) + io fc _i(n) 
Therefore, 



2n + p 



(45) 



£7[JV«;fc_i(JV)|y] = |£K_i(iV + 1) 



(k — b — l + (j>(N + k— l))w k ^(N) 



2N + p 

Replacing E[Nw k -i(N)\y] in (44) with the last line in (45) and dividing 
through by E[w k _i(N)\y] yields (43). □ 

Proposition A. 5. There exists a constant cS (0, oo) such that 



Wfe-i(n + l) 



<c 



for all n £ Z + and a// E {1,2,3}. Note that c may depend on p, a and b, 
but does not depend on n and k. 

Proof. We first handle the case a = 0. Equation (35) shows that 

Wk-i(n) _ n + p/2 
u) k -i(n + l) n + p/2 + k — b—l 

Since b<p/2, the denominator is always strictly positive. Furthermore, for 
any k € {1,2,3}, this fraction clearly converges to 1 as n — ► oo. Thus, the 
fraction is bounded above by a positive, finite constant. 
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The a > case is similar, but we do not have the luxury of using the exact 
expression (35). A rearrangement of (41) yields 

u>fc_i(n) n + p/2 



u>jfc_i(n+l) n + p/2 + k - b- 1 + <p(n + k - 1) 

Since 0(n) is strictly positive, the denominator is always strictly positive. 
As above, for any k G {1,2, 3}, the fraction converges to 1 as n — ► oo. Thus, 
the fraction is bounded above by a positive, finite constant. □ 

Note that for all n G Z + and all fc G {1, 2, 3}, we have 

Wfc-i(ra) _ 'Wfc-i(n) ■w fc _ 1 (n+ 1) ^2 



w fc _i(n + 2) u7 fc _i(n + l)^ fc _i(n + 2) 

Proposition A. 6. Suppose h:Z + —> R is suc/i i/iai /i(n) = 0{n~ l ) as 
n — > oo, where I G {1,2}. 7/ iV|y ~ Poisson{y /2) where y > 0, then there 
exists < d < oo such that for any k G {1, 2, 3}, 

£7[fc(A0it; fc _i(JV)|y] 



£7[zy fc _i(iV)|2/] 



<2 



Proof. We prove the result for 1 = 2. The proof for the I = 1 case is 
similar and is left to the reader. From Proposition A. 5, we know there exists 
a constant c G (0, oo) such that c 2 Wk~i{n + 2) > w^i{n) for all n G Z + and 
all k G {1,2,3}. The assumptions concerning h imply the existence of an 
M G (0,oo) such that \(n + 2)(n + l)h(n)\ < M for all n G Z+. Thus, 

|£ , [/i(A)w/ _i(A r )|y]| 

<E[|/i(JV)|«;*-i(JV)|j/] 

e-^ 2 (y/2) n 



= E IM n )K-i(n)- 

n=0 n - 

4 ^ e~y/ 2 (v/2) n+2 
= -lE(« + 2)(n + l)\Kn)\ Wk ^{n) J' j , 

4r 2 ^ e _ 2// 2 lW2 N > n+2 

< Tj- £(» + 2 )(^ + 1) w»)N-i(» + 2) (n 7 2 ;, 

4c 2 M ™ , ,e-^ 2 (y/2f+ 2 
V 2 (n + 2)! 

4c 2 M ~ N e-^ 2 (y/2) m 

< — — 2^ w fe-i( m ) 



' — ' ml 

a m=U 
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4c 2 M 

^E[w k ^{N)\y\. 

y 



□ 



Proposition A. 7. Suppose N\y ~ Poisson(y/2) where y > 0. For each 
k 6 {1,2,3}, there exists a bounded function ip k and a constant d k 6 (0,oo) 
such that \ijjk(y)\ <d k /y for y > and 

E[w k (N)\y] P , ofu , n.f 
E K _ 1 (iV)| y ] = 2 +2(fc - 6 - 1) + 2 + ^ ) - 

Proof. Suppose a > and fix fc 6 {1,2,3}. The a = case is simpler 
and is left to the reader. Rearranging (41) and taking expectations yields 



E 



Wk-i(N) 



N + p/2 
= E 



w k -i(N + l) 



(46) 



N + p/2 + k-b-l + (f>(N + k-l) 



y n=0 



n + 1 



n + p/2 + k — b — l + 4>(n + k - 1) 



m=l 



m 



-y/ 2 (y/2) n+1 
(n+1)! 

e"^ 2 (y/2) m 



^^wHtl + Ofm- 1 )] 



m + p/2 + &; - - 2 + 0(m + k - 2) 
e-f/ 2 (y/2) m 



77?! 



m=0 



777! 



y y 



Therefore, 



(47) 



y(k-b- 1) E[w k ^(N)/(N + p/2)\y] 



k-b-l + (k-b-l 



£[iu fc -i(i\0li/] 

^K-iC^ocjv- 1 ; 



£K_!(A0|y] 



This allows us to rewrite (43) as 

F ?" t( ^, - f + !«*-»- 1) + * +*.(») + W.) + *.<»). 
El«;fe_i(iV)|yJ 2 2 
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where 



' E[w k ^(N)\y) 



y E[cp{N + k - l)w k ^{N)/{N + p/2)\y} 

A2iy) := 2 E^mv] ' 



E[</>(N + k- l)w k -i(N)\y] 



E[w k ^(N)\y] 

Let ipk(y) = Sj=i i^kjiy)- Consider if) k \{y). Proposition A. 6 implies that 
there exists a d* kl G (0, oo) such that 

E[w k ^(N)0{N- x ) 



E[w k ^(N)\y] 



< 



which implies that 



Wki(v)\ < 



life -6 



1 \rl* 
L \ a kl 



y 



4i 
y 



This shows that ip k \ is bounded for large y. Note that (47) yields 

y (k-b- 1) #K_!(iV)/(iV + p/2)|y] 



IV'fci^)! 



<\b+l-k\+y 



2 

jfe-6- 



E[«; fc _i(JV)|y] 



1 



P 



which shows that ip k i is bounded for small y as well. 

Now consider i/j k 2- Since (ft(n + k — l)/(n+p/2) is 0(n~ 2 ), Proposition A. 6 
implies that there exists a d k2 G (0, oo) such that, for all y > 0, 



£[0(iV + fc - Vwk-tW/iN + P /2)\y] 



and hence, 



£K_i(A0|y] 



hM*)l<£% 
2 y l 



< 



d 



kg 

2 ■ 



dk2 

y 



As above, this shows that tp k 2 is bounded for large y. The fact that V>fc2 is 
bounded for small y follows from the fact that cp is bounded. 

Finally, consider Since (j){n) = 0{n~ l ), Proposition A. 6 implies that 
there exists a d£ 3 G (0, oo) such that, for all y>0, \ipks(y)\ < d k $/y. Again, 
the boundedness of Vto follows from that of 4>. Putting all of this together, 
we find that 



\My)\ 



^Z^kj{y) 



<£My(i/)l<3 



max{4i, 4 2 ,43} 
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Moreover, since each \ipkj(y)\ is bounded for y > 0, so is \ipk{y)\- 



□ 



We can now assert that 
E[wi(N) 



E[w Q (N) 
E[w 2 {N) 



E[wi(N) 
E[w 3 (N) 



E[w 2 (N) 



V] =--2b+- + My), 

g = f+2(l-6) + | + ^(y), 

M = | + 2(2-6) + | + ^(y), 



where each tpi(y) is bounded and \ipi(y)\ < d/y for all y > and a constant 
cZ £ (0, oo). We need one more technical result. 

Proposition A. 8. Suppose that Y\rj ~ X^iv) and that p>3. If there 
exists a < d < oo such that \tp(y)\ < d/y, then \E[iJj(Y)\r]]\ < 2d/r] so that 
E[ip(Y)\r]] = 0(t]~ 1 ) as rj — > oo. 



Proof. 

\E^{Y)\rj\\< 



MV)\ E :L . _ L dy 



n=0 



111 



oo 

n=0 



-v/2(rj/2) n 



)V. 



y 



2T{n + p/2) 

n+p/2~2 p -y/2 

dy 



2 n +P/ 2 T{n + p/2) 



- V /2 



fa/2)' 



)V. 



(2n + p-2) 



n=0 

2d ~ 

r?^ (2n+p-2) (n + 1) 



(n + 1) 



-tj/2 



fa/2) 



n+1 



< 2dg 

^ n=0 



V 2 (77/2) n+1 2d 



□ 



Assume that Y\q ~ Xpfa)- Proposition A. 8 implies that ^^(Y^Hn] = 
0(?] _1 ) as t] — ► oo. Furthermore, since \yipi(y)\ < d, E[\Yi/ji(Y)\\rj\ = O(l) 
as 7] — > oo. Also, since the ^ are bounded, so are products of the i^i. 
Thus, £[|^(Y>i(Y)N and E[\i/}i(Y)i/} j (Y)il; k (Y)\\rj\ are both 0(1) as r? 
oo for any i,j,k £ {1,2,3}. Finally, since \y 2 ipi(y)\ < dy, it follows that 
£;[y 2 |-0i(y)||r?] <dE[y|7?] = d(n+p) = 0(n) as ry^oo. 

Putting the above work together (and using the moments of the noncen- 
tral x 2 given at the beginning of this section), we calculate that 

-E( W1 (N)\Y) 



E 



E(w (N)\Y) 



+ p -2b + 0(r] 
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(in all of these equations, it is understood that the limits are taken as r/ — > oo) 



E 



E{w 2 {N)\Y) 



E 



E(w (N)\Y) 



E(w 3 (N)\Y) 



E 



E(w 2 (N)\Y) E( Wl (N)\Y) 



E( Wl (N)\Y) E(w (N)\Y) 



^ + (p + 2-2% + 0(l), 



E(w Q (N)\Y) 



E 
1 



E(w 3 (N)\Y) E(w 2 (N)\Y) E( Wl (N)\Y) 



E(w 2 (N)\Y) E(wi(N)\Y) E(w (N)\Y) 

-(r? 3 + (6p- 126 + 24)r ? 2 ) +0{rf). 
8 

We are now in position to calculate Mi (77), H 2 {rj) and ^3(77) in (38): 
-E( Wl {N)\Y) 



H 1 (r l ) = 2E 



fi 2 ( V ) = 4E 



E(w (N)\Y) 
r} + 2p-4b + 0(r]~ 1 ) -rj 
2p-Ab + 0{7 1 - 1 ), 
E{w 2 {N)\Y) 



E(w (N)\Y) 



Ar]E 



E( Wl (N)\Y) 



E(w (N)\Y) 



+ 7T 



= 4^ + (p + 2-26)^ -4 v (^+p-2bj +7] 2 + 0(l) 
= 8 ?? + 0(l), 



lE(w 3 (N)\Y) 




- Ylj]E 


IE(w 2 (N)\Y) 




[E(w (N)\Y) 


V 


[E(wo(N)\Y) 


V 



+ 6t] 2 E 



E( Wl (N)\Y) 



E(w (N)\Y) 



(r? 3 + (6p - 12b + 24)r? 2 ) - I2r}[ — + (p + 2 - 26)77 



2 1 



+ 677 V 2 + p _ 26 y _r? +°( ? ?) = ( ? ?)- 

This completes the proof of Proposition A.l. □ 
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